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PROTEIN WHICH INTERACTS WITH THE HUNTINGTON'S DISEASE GENE 
PRODUCT, cDNA CODING THEREFOR, AND ANTIBODIES THERETO 

BACKGROUND OF THE INVENTION 

This application relates to a protein designated as H1P1 which interacts with 
the Huntington's Disease gene product, cDNA coding for HIP1, and methods and 
compositions relating thereto, 
5 "Interacting proteins" are proteins which associate in vivo to form specific 

stable complexes Non-covalent bonds, including hydrogen bonds, hydrophobic interactions 
and other molecular associations form between the proteins when two protein surfaces are 
matched or have affinity for each other. This affinity or match is required for the recognition 
of the two proteins, and the formation of a stable interaction. Protein-protein interactions are 
1 0 involved in the assembly of enzyme subunits, in antigen-antibody reactions; in forming the 
supramolecular structures of ribosomes, filaments, and viruses; in transport; and in the 
interaction of receptors on a cell with growth factors and hormones. 

Huntington's disease is an adult onset disorder characterized by selective 
neuronal loss in discrete regions of the brain and spinal chord that lead to progressive 
1 5 movement disorder, personality change and intellectual decline. From onset, which generally 
occurs around age 40, the disease progresses with worsening symptoms, ending in death 
approximately 18 years after onset. 

The biochemical cause of Huntington's disease has thus far not been 
determined. Various theories have been advanced, but each has failed to stand up to 
20 experimental evidence designed to test its validity. For example, it was suggested that the 
selective neuronal loss could be attributed to restricted expression of mRNA or proteins in 
cells undergoing degeneration. No obviously altered levels of mRNA transcript or protein 
expression has ever been observed in HD-affected tissues, however. 

While the biochemical cause of Huntington's disease has remained elusive, a 
25 mutation in a gene within chromosome 4pl 6.3 subband has been identified and linked to the 
disease. This gene, referred to as the Huntington's Disease or HD gene, contains three repeat 
regions, a CAG repeat region and two CCG repeat regions. Testing of Huntington's disease 
patients has shown that the CAG region is highly polymorphic, and that the number of CAG 
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repeat units in the CAG repeat region is a very reliable diagnostic indicator of having inherited 
the gene for Huntington's disease. Thus, in control individuals and in individuals suffering 
from neuropsychiatry disorders other than Huntington's disease, the number of CAG repeats 
is between 9 and 35, while in individuals suffering from Huntington's disease the number of 
5 CAG repeats is expanded and is 36 or greater. 

The protein product encoded by the HD gene has been localized to the 
cytoplasm, including to the membranes of vesicles on the brain of both normal and HD- 
affected individuals. To date, no differences have been observed at either the total RNA, 
mRNA or protein levels between normal and HD-affected individuals. Thus, the function 
10 of the HD protein and its role in the pathogenesis of Huntington's Disease remain to be 
elucidated. 

SUMMARY OF THE INVENTION 

We have now identified a protein, designated as HIP1, that interacts 

1 5 differently with the gene product of a normal (16 CAG repeat) and an expanded ( >44 
CAG repeat) HD gene. The HIP1 protein originally isolated from the yeast two-hybrid 
screen is encoded by a 1.2 kb cDNA, devoid of stop codons, that is expressed as a 400 
amino acid polypeptide. By further screening of a human frontal cortex cDNA library, 
and employing the protocol for 5 1 Rapid Amplification of cDNA ends (RACE), a total of 

20 4795 nucleotides (with an open reading frame of 914 amino acids) of the 10 kb message 
HIP1 have been isolated to date. Expression of the HIP1 protein was found to be limited 
to the brain, where the interaction of the HIP1 with the HD protein appears to be necessary 
for the association of the HD protein with the membrane or specific cytoskeletal 
components to render it functional. Because HIP1 interacts with expanded HD protein less 

25 well than with normal length HD, introduction of additional H1P1 or overexpression of 
HIP'l can lead to increased functionality of the defective or normal HD protein. 
Alternatively, modified forms of the HIP I which bind more effectively to expanded HD 
could be introduced to convert the expanded HD protein into a functional molecule. 
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BRIEF DESCRIPTION OF THE DRAWING 

Fig. 1 graphically depicts the amount of interaction between HIP1 and 
Huntingtin proteins with varying lengths of polyglutamine repeat. 

5 DETAILED DESCRIPTION OF THE INVENTION 

The HIP I protein which interacts with the HD gene product was identified 
using the yeast two-hybrid system described in US Patent No. 5,283,173 which is 
incorporated herein by reference. Briefly, this system utilizes two chimeric genes or plasmids 
expressible in a yeast host. The yeast host is selected to contain a detectable marker gene 

1 0 having a binding site for the DNA binding domain of a transcriptional activator. The first 

chimeric gene or plasmid encodes a DNA-binding domain which recognizes the binding site of 
the selectable marker gene and a test protein or protein fragment. The second chimeric gene 
or plasmid encodes for a second test protein and a transcriptional activation domain. The two 
chimeric genes or plasmids are introduced into the host cell and expressed, and the cells are 

1 5 cultivated. Expression of the detectable marker gene only occurs when the gene product of 
the first chimeric gene or plasmid binds to the DNA binding domain of the detectable marker 
gene, and a transcriptional activation domain is brought into sufficient proximity to the DNA- 
binding domain, an occurrence which is facilitated by protein-protein interactions between the 
first and second test proteins. By selecting for cells expressing the detectable marker gene, 

20 those cells which contain chimeric genes or plasmids for interacting proteins can be identified, 
and the gene can be recovered and identified. 

In testing for Huntington Interacting Proteins, several different plasmids were 
prepared containing portions of the HD gene. The first four, identified as 16PGBT9, 
44pGBT9, 80pGBT9 and 128pGBT9, were GAL4 DNA binding domain-HD in-frame 

25 fusions containing nucleotides 3 1 4 to 1955 (amino acids ] -540) of the published HD cDNA 
sequences cloned into the vector pGBT9 (Clontech). These plasmids contain a CAG repeat 
region of 16, 44, 80 and 128 glutamine-encoding repeats, respectively. A clone (DMK 
BamHlpGBT9) was made by fusing acDNA encoding the first 544 amino acids of the 
myotonic dystrophy gene (a gift from R. Komeluk) in-frame with the GAL4-DNA BD of 

30 pGBT9 and was used as a negative control. 
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These plasmids have been used to identify and characterize HIP1, two 
additional HD-interacting proteins, HIP2 and HIP3 proteins, and can be farther used for the 
identification of additional interacting proteins, and for tests to refine the region on the protein 
in which the interaction occurs. Thus, a first aspect of the invention is these four plasmids, 
5 and the use of this plasmids in identifying HD-interacting proteins. Furthermore, it will be 
appreciated that the GAL4 DNA-binding and activating domains are not the only domains 
which can be used in the yeast two-hybrid assay. Thus, in a broader sense, the invention 
encompasses any chimeric genes or plasmids containing nucleotides 314 to 1955 of the HD 
gene together with an activating or DNA-binding domain suitable for use in the yeast one, 

10 two- or three-hybrid assay for proteins critical in either binding to the HD protein or 
responsible for regulated expression of the HD gene. 

After introducing the plasmids into Y190 yeast host cells, transforming the host 
cells with an adult human brain Matchmaker 1 '' 1 (Clontech) cDNA library coupled with a GAL4 
activating domain, and selecting for the expression of two detectable marker genes to identify 

1 5 clones containing genes for interacting proteins, the activating domain plasmids were 

recovered and analyzed. As a result of this analysis, three different cDNA fragments were 
identified as encoding for HD-interacting proteins and designated as HIP1, HIP2 and HIP3. 
The sequences of HIP 1 and HIP3 are given in Seq. ID Nos 1 and 3. The polypeptides which 
each encodes are given by Seq. ID Nos. 2 and 4. Further investigation of the HIP I cDNA 

20 resulted in the characterization of an additional region of cDNA totaling 4795 bases and a 

corresponding protein, the sequences of which are given by Seq ID Nos 5 and 6, respectively 

The cDNA molecules, particularly those encoding portions of HIP 1, can be 
explored using oligonucleotide probes for example for amplification and sequencing. In 
addition, oligonucleotide probes complementary to the cDNA can be used as diagnostic 

25 probes to localize and quantify the presence of HIP1 DNA Probes of this type with a one or 
two base mismatch can also be used in site-directed mutagenesis to introduce variations into 
the HIP1 sequence which may increase. Thus, a fijrther aspect of the present invention is an 
oligonucleotide probe, preferably having a length of from 15-40 bases which specifically and 
selectively hybridizes with the cDNA given by Seq. ID No. 1 or 5 or a sequence complemen- 

30 tary thereto As used herein, the phrase "specifically and selectively hybridizes with" the 
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cDNA refers to primers which will hybridize with the cDNA under stringent hybridization 
conditions 

DNA sequencing of the HIP1 cDNA initially isolated from the yeast two-hybrid 
screen revealed a 1 .2 kb cDN A that shows no significant degree of nucleic acid identity with 

5 any stretch of DNA using the blastn program at ncbi (blast@ncbi.nlm.nih.gov). When the 

entire HIP 1 cDNA sequence (SEQ ID NO. 5) is translated into a polypeptide, the entire HIP1 
cDNA coding (nucleotides 328-3069) is observed to be devoid of stop codons, and to produce 
a 914 amino acid polypeptide. A polypeptide identity search revealed an identity match over 
the entire length of the protein (46% conservation) with that of a hypothetical protein from C. 

10 elegam (ZK370.3 protein, C elegans cosmid ZK370). This C. elegans protein shares iden- 
tity with the mouse talin gene, which encodes a 217 kDa protein implicated with maintain- 
ing integrity of the cytoskeleton. It also shares identity with the SLA2/MOP2/ END4 gene 
from Saccharomyces cerevisiae, which is known to code for an essential cytoskeletal 
associated gene required for the accumulation and or maintenance of plasma membrane H + - 

1 5 ATPase on the cell surface. When pairwise comparisons are performed between HIP1 and 
the C. elegans ZK370.3 protein (Genpept accession number ceizk370.3), it shows 26% 
complete identity and an overall 46% level of conservation. Comparative analysis between 
H1P1 and SLA2/MOP2/ END4 (EM BL accession number Z2281 1) demonstrate similar 
conservation (20% identity, 40% conservation). 

20 HIP2 is a 2.0 kb cDNA that encodes all but the 5'-most 33 amino acids of 

human E2 25k ubiquitin conjugating enzyme. The resulting peptide has 100% identity with 
the previously characterized bovine £2^ protein. The cDNA has 95% nucleotide identity 
with the bovine cDNA. Ubiquitin-conjugating enzyme is an important component in 
ubiquitin-mediated protein degradation pathways. 

25 No difference in the strength of the interaction between HIP2 and HD 

constructs containing either 44 or 15 CAG repeats is detected using a quantitative P- 
galactosidase assay. The expression pattern of HIP2 (E2 25k ) in the various parts of the 
brain and nervous system appears to follow the specific neuropathology observed in HD, 
although there does not appear to be any difference in expression levels between HD- 

30 affected and HD-non-affected individuals. 
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The third cDNA encoding an HD-interacting protein is a 537 bp cDNA 
coding for 187 amino acids. A search of known DNA databases did not identify the 
sequence homology with any known genes. However, when a protein search was per- 
formed using the blatsp server, a strong identity between HIP3 and ankyrin-related proteins 
5 was observed. The strongest identity was with the D2021.8 gene product of C elegans, an 
uncharacterized gene, but there is also a 41% identity with AKR1, a yeast ankyrin repeat- 
containing protein. Furthermore, when analogous structures with charge conservation over 
the same amino acid stretch are considered, there is 70% protein identity. HIP3 also shares 
approximately 60% amino acid conservation with human brain specific ankyrins (ankyrin B 

10 and ankyrin C). Thus, it is reasonable to conclude that H1P3, like known ankyrins, is a 
cytoskeletal protein, and may be involved, like previously characterized ankyrins in 
promoting interactions between the membrane skeleton and other membrane proteins. 

Further exploration of these three HD interacting proteins revealed several 
important facts about HIP1 that implicate it in a significantly in the pathogenesis of 

15 Huntington's Disease. First, as shown in Fig, I, it was found that the strength of the 
interaction between HD protein and HIP1 is dependent on the number of CAG repeats. 
Second, it was found that expression of the HIP1 protein is not ubiquitous, but is limited to 
brain tissue. The highest amounts of expression are in the cortex, with lower levels being 
seen in the cerebellum, caudate and putamen. 

20 Both HIP1 and HIP3 appear to be proteins which are involved in the 

maintaining the structural integrity of the cytoskeleton and various components of the 
cellular membrane, including microtubules and focal adhesions. Based upon this, the HD 
protein may be associated as part of the cytoskeletal matrix in cells where it is expressed, 
and our work supports the conclusion that binding of H1P1 to the HD protein is necessary 

25 for the functional incorporation of the HD protein into the cell membrane. In this 

circumstance, the larger polyglutamine tract in huntingtin has a decreased ability for an 
HIP1-HD interaction. This decreased affinity for each other disrupts the normally strong 
HD-HIP1 -cytoskeletal anchoring association. Further, the HIP1-HD interaction may be a 
critical interaction at the membranes of synaptic vesicles and a decrease in the affinity of 

30 HIP1 for huntingtin may affect protein trafficking or membrane organization throughout 
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the neuron. Finally, we have demonstrated that HIP1 and HD are both found in the Triton 
X-100 insoluble membrane compartment of the cell, therefore, a decreased interaction 
between HIP1 and huntingtin may allow an abnormally subtle amount of huntingtin to be 
found in subcellular compartments in which it is normally found. 

5 Asa result of all three of these phenomenon, increased apoptosis can occur 

in specific neurons within the striatum. This increase in apoptosis arises from an increased 
susceptibility of polyglutamine-expanded huntingtin to cleavage by apopain, and because 
more of the expanded forms of the HD protein may be available for cleavage (and 
subsequent apoptosis) due to the fact they are not as tightly associated at the HD-HIP1- 

10 cytoskeletal complex. 

This understanding of a biochemical basis for the pathogenesis of 
Huntington's Disease opens the doorway to a therapeutic method to ameliorate the 
pathology in patients expressing huntingtin protein with expanded polyglutamine tracts. In 
accordance with the method, the patient is treated to increase the amount of HIP1 or an 

15 equivalent polypeptide which interacts less well with expanded Huntingtin than with 
Huntingtin having a CAG repeat region containing 15 to 35 repeats and facilitates the 
incorporation of Huntingtin into brain cell membranes. 

Increasing expression of HIP1 or an equivalent polypeptide can be 
accomplished using gene therapy approaches. In general, this will involve introduction of 

20 DNA encoding HIPI in an expressable vector into the brain cells. Vectors which have 
been shown to be suitable expression systems in mammalian cells include the herpes 
simplex viral based vectors: pHSVl (Gelleretal. Proc. Natl. Acad. Sci 87:8950-8954 
(1990)); recombinant retroviral vectors: MFG (Jaffee et al. Cancer Res. 53:2221-2226 
(1993)); Moloney-based retroviral vectors: LN, LNSX, LNCX, LXSN (Miller and 

25 Rosman Bio^echniques 7:980-989 (1989)); vaccinia viral vector: MVA (Sutter and Moss 
Proc. Natl. Acad. Sci. 89:10847-10851 (1992)); recombinant adenovirus vectors : pJM17 
(Ali et al Gene Therapy 1:367-384 (1994)), (Berkner K. L. Biotechniques 6:616-624 
1988): second generation adenovirus vector: DE1/DE4 adenoviral vectors (Wang and 
Finer Nature Medicine 2:714-716 (1996) ); and Adeno-associated viral vectors: 

30 AAV/Neo (Muro-Cacho et al. J. Immunotherapy 11:231-237 (1992)). 
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Delivery of retroviral vectors to brain and nervous system tissue has been 
described in US Patents Nos. 4,866,042, 5,082,670 and 5,529,774, which are incorporated 
herein by references. These patents disclose the use of cerebral grafts or implants as one 
mechanism for introducing vectors bearing therapeutic gene sequences into the brain, as 
5 well as an approach in which the vectors are transmitted across the blood brain barrier. 
In addition to increasing the amount of HIP1 present in brain cells of 
affected individuals, HD lethal phenotype may be rescued by coexpression of a HIP1 and 
normal sized HD protein within the same cell, specifically within neurons. The over- 
expression of the normal HD protein and the presence of excess HIP1 in the cell may be 

10 able to override the damaging effects of a decreased interaction between HIP1 and an 

expanded form of the HD protein. Therefore, a "normal state" of interaction of HD with 
HIP1 will rescue the cell from premature apoptotic death. Thus, a therapeutically desirable 
mammalian expression vector may include both a region encoding H1P1 and a region 
encoding normal (less than 35 repeats) HD protein. 

1 5 To further illustrate the methods of making the materials which are the 

subject of this invention, and the testing which has established their utility, the following 
non-limiting experimental procedures are provided. 

EXAMPLE 1 

20 IDENTIFICATION OF INTERACTING PROTEINS 

GAL4-HD cDNA constructs 

An HD cDNA construct (44pGBT9), with 44 CAG repeats was generated 
encompassing amino acids 1 - 540 of the published HD cDNA . This cDNA fragment was 
25 fused in frame to the GAL4 DNA-binding domain (BD) of the yeast two-hybrid vector 
pGBT9 (Clontech). Other HD cDNA constructs, 16pGBT9, 80pGBT9and 128pGBT9 
were constructed, identical to44pGBT9 but included only 16, 80 or 128 CAG repeats, 
respectively. 

Another clone (DMKDBamHlpGBT9) containing the first 544 amino acids 
30 of the myotonic dystrophy gene (a gift from R. Korneluk) was fused in-frame with the 
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GAL4-DNA 3D of pGBT9 and was used as a negative control. Plasmids expressing the 
GAL4-BDRAD7 (D, Gietz, unpublished) and SIR3 were used as a positive control for the 
P-galactosidase filter assay. 

The clones IT15-23Q, IT15-44Q and HAP1 were generous gifts from Dr. C. 
5 Ross. These clones represent a previously isolated huntingtin interacting protein that has a 
higher affinity for the expanded form of the HD protein. 

Yeast strains, transformations and p-gal actosidase assays 

The yeast strain YI90 (MATa Ieu2-3,1 12, ura3-52, trpl-901, his3-A200, 
10 ade2-101, gal4Agal80A, URA3;:GAL-lacZ, LYS2::GAL-HIS3,cyc r ) was used for all 

transformations and assays. Yeast transformations were performed using a modified lithium 
acetate transformation protocol and grown at 30 C using appropriate synthetic complete (SC) 
dropout media 

The P-galactosidase chromogenic filter assays were performed by transfer- 
1 5 ring the yeast colonies onto Whatman filters. The yeast cells were lysed by submerging the 
filters in liquid nitrogen for 15-20 seconds. Filters were allowed to dry at room tempera- 
ture for at least five minutes and placed onto filter paper presoaked in Z-buffer (100 mM 
sodium phosphate (pH7.0) 10 mM KG, 1 mM MgS0 4 ) supplemented with 50 mM 
2-mercaptoethanol and 0.07 mg/ml 5-bromo-4-chloro-3-indolyl P-D-galactoside (X-gal). 
20 Filters were placed at 37 C for up to 8 hours. 

Yeast two-hybrid screening for huntingtin interacting protein (HIP) 

cDNAs from an human adult brain Matchmaker™ cDNA library (Clontech) 
was transformed into the yeast strain Y190 already harboring the 44pGBT9 construct. The 

25 transformants were plated onto one hundred 1 50 mm x 1 5 mm circular culture dishes 

containing SC media deficient in Trp, Leu and His. The herbicide 3-amino-triazole (3 -AT) 
(25mM) was utilized to limit the number of false His+ positives (3 1) The yeast transformants 
were placed at 30 C for 5 days and p-galactosidase filter assays were performed on all 
colonies found after this time, as described above, to identify [}-gaiactosidase+ clones. 

30 Primary His+/p~galactosidase+ clones were then orderly patched onto a grid on SC 
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-Trp/-Leu/-His (25 mM 3 AT) plates and assayed again for His+ growth and the ability to turn 
blue with a filter assay. Secondary positives were identified for further analysis. Proteins 
encoded by positive cDNAs were designated as HIPs (Huntingtin Interactive Proteins). 
Approximately 4.0 x 10 7 Trp/Leu auxotrophic transformants were screened and of 14 clones 
5 isolated 12 represented the same cDNA (HIP1), and the other 2 cDNAs, HIP2 and HIP3 
were each represented only once. 

The HIP cDNA plasmids were isolated by growing the His + /p- 
ga!actosidase + colony in SC -Leu media overnight, lysing the cells with acid-washed glass 
beads and electroporating the bacterial strain, KC8 (leuB auxotrophic) with the yeast lysate. 
10 The KC8 ampicillin resistant colonies were replica plated onto M9 (-Leu) plates. The 

plasmid DNA from M9+ colonies was transformed into DH5-a for further manipulation. 



CONFIRMATION OF INTERACTIONS 

1 5 The HIP 1-GAL4-AD cDNA activated both the lac-Z and His reporter genes in 

the yeast strain Y190 only when co-transformed with the GAL4-BD-HD construct, but not 
the negative controls (Figure 1) of the vector alone or a random fusion protein of the 
myotonin kinase gene. In order to assess the influence of the polyglutamine tract on the 
interaction between HIP1 and HD, semi-quantitative P-galactosidase assays were 

20 performed. GAL4-BD-HD fusion proteins with 16, 44 , 80 and 128 glutamine repeats were 
assayed for their strength of interaction with the GAL4-AD-HIP1 fusion protein. 

Liquid p-galactosidase assays were performed by inoculating a single yeast 
colony into appropriate synthetic complete (SC) dropout media and grown to OD600 
0.6-1.5. Five milli litres of overnight culture was pelleted and washed once with 1 ml of 

25 Z-Buffer, then resuspended in 100 ml Z-Buffer supplemented with 38 mM 2-mercapto- 
ethanol, and 0.05% SDS. Acid washed glass beads ('100 ml) were added to each sample 
and vortexed for four minutes, by repeatedly alternating a 30 seconds vortex, with 30 
seconds on ice. Each sample was pelleted and 10 ml of lysate was added to 500 ml of lysis 
buffer. The samples were incubated in a 30 C waterbath for 30 seconds and then 100 ml of 

30 a 4 mg/ml o-nitrophenyl b-D galactopyranoside (ONPG) solution was added to each tube. 
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The reaction was allowed to continue for 20 minutes at 30 C and stopped by the addition of 
500 ml of 1 M Na 2 C0 3 and placing the samples on ice. Subsequently, OD420 was taken in 
order to calculate the P-galactosidase activity with the equation 1000 x OD420/(t x V x 
OD600) where t is the elapsed time (minutes) and V is the amount of lysate used. 
5 The specificity of the HIP1-HD interaction can be observed using the 

chromogenic filter assay. Only yeast cells harboring HIP1 and HD activate both the HIS 
and lacZ reporter genes in the Y190 yeast host. The cells that contain the HIP1 with HD 
constructs with 80 or 128 CAG repeats turn blue approximately 45 minutes after the cells 
with the smaller sized repeats (16 or 44). 
10 No difference in the p-galactosidase activity was observed between the 16 

and 44 repeats or between the 80 and 128 repeats. However, a significant difference 
(p<0.05) in activity is seen between the smaller repeats (16 and 44) and the larger repeats 
(80 and 128). (Figure 1) 

15 EXAMPLE 3 

DNA SEQUENCING. cDNA ISOLATION AND 5' RACE 
Oligonucleotide primers were synthesized on an ABI PCR-mate oligo- 
synthesizer. DNA sequencing was performed using an ABI 373 fluorescent automated 
DNA sequencer. The HIP cDNAs were confirmed to be in-frame with the GAL4-AD by 

20 sequencing across the AD-HIP1 cloning junction using an AD oligonucleotide (5'GAA 
GAT ACC CCA CCA AAC3'). 

Subsequently, primer walking was used to determine the remaining 
sequences, A human frontal cortex >4.0 kb cDNA library (a gift from S. Montal) was 
screened to isolate the full length HIP1 gene. Fifty nanograms of a 558 base pair Eco RI 

25 fragment from the original HIP1 cDNA was radioactively labeled with |a 32 P]-dCTP using 
nick-translation and the probe allowed to hybridized to filters containing > 105 pfu/ml of 
the cDNA library overnight at 65 C in Church buffer (see Northern blot protocol). The 
filters were washed at 65 C for 10 minutes with 1 X SSPE, 15 minutes at 65 C with 1 X 
SSPE and 0. 1 % SDS, then for thirty minutes and fifteen minutes with 1 X SSPE and 0. 1 % 

30 SDS. The filters were exposed to X-ray film (Kodak, XAR5) overnight at -70 C. Primary 
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positives were isolated and replated and subsequent secondary positives were hybridized 
and washed as for the primary screen. The resulting positive phage were converted into 
plasmid DNA by conventional methods (Stratagene) and the cDNA isolated and sequenced. 

In order to obtain the most 5' sequence of the HIP1 gene, a Rapid 
Amplification of cDNA Ends (RACE) protocol was performed according to the 
manufacturers recommendations (BRL). First strand cDNA was synthesized using the 
oligo H1P1-242R (5 1 GCT TGA CAG TGT AGT CAT AAA GGT GGC TGC AGT CC 
3'). After dCTP tailing the cDNA with terminal deoxy transferase, two rounds of 35 
cycles (94 C 1 minute; 53 C 1 minute; 72 C 2 minutes) of PCR using HIP1-R2 (5' GGA 
CAT GTC CAG GGA GTT GAA TAC 3') and an anchor primer (5' (CUA)4 GGC CAC 
GCG TCG ACT AGT ACG GGI IGG Gil GGG IIG3*) (BRL) were performed. The 
subsequent 650 base pair PCR product was cloned using the TA cloning system 
(Invitrogen) and sequenced using T3 and T7 primers. Sequences ID Nos. 1 and 5 show the 
sequence of the HIP1 cDNAs obtained, 

EXAMPLE 4 
DNA AND AMINO ACID ANALYSES 
Overlapping DNA sequence was assembled using the program MacVector 
and sent via email or Netscape to the BLAST server at N1H (http://www.ncbi.nlm.nih.gov) 
to search for sequence similarities with known DNA (blastn) or protein (tblastn) sequences. 
Amino acid alignments were performed with the program Clustalw. 

EXAMPLE 5 

FISH DETECTION SYSTEM AND IMAGE ANALYSIS 
The HIP1 cDNA isolated from the two-hybrid screen was mapped by 
fluorescent in situ hybridization (FISH) to normal human lymphocyte chromosomes 
counterstained with propidium iodide and DAPI. Biotinylated probe was detected with 
avidin-fluorescein isothiocyanate (FITC). Images of metaphase preparations were captured 
by a thermoelectrically cooled charge coupled camera (Photometries). Separate images of 
DAPI banded chromosomes and FITC targeted chromosomes were obtained. Hybridization 
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signals were acquired and merged using image analysis software and pseudo colored blue 
(DAPI) and yellow (FITC) as described and overlaid electronically. This study showed 
that HIP1 maps to a single genomic locus at 7qll.2. 

EXAMPLE 6 
NORTHERN BLOT ANALYSIS 

RNA was isolated using the single step method of homogenization in 
guanidinium isothiocyante and fractionated on a 1.0% agarose gel containing 0.6 M 
formaldehyde. The RNA was transferred to a hybond N -membrane (Amersham) and 
crossl inked with ultraviolet radiation. 

Hybridization of the Northern blot with b-actin as an internal control probe 
provided confirmation that the RNA was intact and had transferred. The 1.2 kb HIP1 
cDNA was labeled using nick translation and incorporation of a"P-dCTP. Hybridization 
of the original 1.2 kb HIP1 cDNA was carried out in Church buffer (0,5 M sodium 
phosphate buffer, pH 7.2, 2,7% sodium dodecyl sulphate, 1 mM EDTA) at 55 C over- 
night. Following hybridization, Northern blots were washed once for 10 minutes in 2.0 X 
SSPE, 0.1% SDS at room temperature and twice for 10 minutes in 0.15 X SSPE, 0.1% 
SDS. Autoradiography was carried our from one to three days using Hyperfilm 
(Amersham) film at -70 C. 

Analysis of the levels of RNA levels of HIP1 by Northern blot data revealed 
that the 10 kilo base HIP1 message is present in all tissue assessed. However, the levels of 
RNA are not uniform, with brain having highest levels of expression and peripheral tissues 
having less message. No apparent differences in RNA expression was noted between 
control samples and HD affected individuals. 

EXAMPLE 7 
TISSUE LOCALIZATION OF H1P1 
Tissue localization of HIP1 was studied using a variety of techniques as 
described below. Subcellular distribution of HIP- 1 protein in adult human and mouse 
brain Biochemical fractionation studies revealed the HIPl protein was found to be a 
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membrane-associated protein. No immunoreactivity was seen by Western blotting in 
cytosolic fractions, using the anti-HIPl-pepl polyclonal antibody, HIP1 immunoreactivity 
was observed in all membrane fractions including nuclei (PI), mitochondria and synapto- 
somes (P2), microsomes and plasma membranes (P3). The P3 fraction contained the most 
5 HJP1 compared to other membrane fractions. HIP1 could be removed from membranes by 
high salt (0.5M NaCI) buffers indicating it is not an integral membrane protein, however, 
since low salt (0.1- 0.25M NaCI) was only able to partially remove HIP1 from membranes, 
its membrane association is relatively strong. The extraction of P3 membranes with the 
non-ionic detergent, Triton X-100 revealed HIP1 to be a Triton X-100 insoluble protein. 

10 This characteristic is shared by many cytoskeletal and cytoskeletal-associated membrane 
proteins including actin, which was used as a control in this study. The biochemical 
characteristics of HIP1 described were found to be identical in mouse and human brain and 
was the same for both forms of the protein (both bands of the HIP1 doublet). H1P1 
co-localized with huntingtin in the P2 and P3 membrane fractions, including the high-salt 

15 membrane extractions, as well as in the Triton X-tOO insoluble residue. The subcellular 
distribution of HIP1 was unaffected by the expression of polyglutamine-expanded 
huntingtin in transgenic mice and HD patient brain samples. 

The localization of HIP1 protein was further investigated by immunohisto- 
chemistry in normal adult mouse brain tissue. Immunoreactivity was seen in a patchy, 

20 reticular pattern in the cytoplasm, appeared excluded from the nucleus and stained most 

intensely in a discontinuous pattern at the membrane. These results are consistent with the 
association of HIP! with the cytoskeletal matrix and further indicate an enrichment of HIP1 
at plasma membranes. Immunoreactivity occurred in all regions of the brain, including 
cortex, striatum, cerebellum and brainstem, but appeared most strongly in neurons and 

25 especially in cortical neurons. As described previously, huntingtin immunoreactivity was 
seen exclusively and uniformly in the cytosol. 

The in situ hybridization studies showed HIP1 mRNA to be ubiquitously and 
generally expressed throughout the brain. This data is consistent with the immunohisto- 
chemical results and was identical to the distribution pattern of huntingtin mRNA in 

30 transgenic mouse brains expressing full-length human huntingtin. 
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Prntein Preparation And Western Blotting F or Expression Studies 

Frozen human tissues were homogenized using a Polytron in a buffer 
containing 0.25M sucrose, 20mM Tris-HCI (pH 7.5), lOmM EGTA, 2mM EDTA 
supplemented with lOug/ml of leupeptin, soybean trypsin inhibitor and ImM PMSF, then 

5 centrifuged at 4,000rpm for 10* at 4 C to remove cellular debris. 100-150ug/lane of protein 
was separated on 8% SDS-PAGE mini-gels and then transferred to PVDF membranes. 
Huntingtin and HIP1 were electroblotted overnight in Towbin's transfer buffer (25 mM 
Tris-HCI, 0.192M glycine, pH8.3, 10% methanol) at 30V onto PVDF membranes 
(ImmobiloivP, Millipore) as described (Towbin et al, Proc. Nat 'I Acad. Sci.(USA) 76: 

10 4350-4354 (1979)). Membranes were blocked for 1 hour at room temperature in 5% skim 
milk/ TBS (lOmM Tris-HCI, 0.15M NaCI, pH7.5). Antibodies against huntingtin (pAb 
BKP1, 1:500), actin (mAb A-4700, Sigma, 1:500) or HIP1 (pAb HIP-pepI, 1:200) were 
added to blocking solution for 1 hour at room temperature. After 3x10 minutes washes 
in TBS-T (0.05% Tween-20/TBS), secondary Ab (horseradish peroxidase conjugated IgG, 

15 Biorad) was applied in blocking solution for 1 hour at room temperature. Membranes were 
washed and then incubated in chemiluminescent ECL solution and visualized using 
Hyperfilm-ECL film (Amersham). 

(feneration of Antibodies 

20 The generation of huntingtin specific antibodies GHM1 and BKP1 is des- 

cribed elsewhere (Kalchman, et ah, /. Biol. Chem. 271: 19385-19394 (1996)). The HIP1 
peptide (VLEKDDLMDMDASQQN, a.a. 76-91 of Seq. ID No. 2) was synthesized with 
Cys on the N-terminus for the coupling, and coupled to Keyhole limpet hemocyanin 
(KLH) (Pierce) with succinimidyl 4-(N-maleimidomethyl) cyclohexame-l-carboxylate 

25 (Pierce). Female New Zealand White rabbits were injected with HIP1 peptide-KLH and 
Freund's adjuvant. Antibodies against the H1P1 peptide were purified from rabbit sera 
using affinity column with low pH elution. Affinity column was made by incubation of 
H1P1 peptide with activated thio-Sepharose (Pharmacia). 

Western blotting of various peripheral and brain tissues were consistent with 

30 the RNA data. The HIP1 protein levels observed was not ubiquitous. The protein 
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expression is limited to brain tissue, with highest amounts seen in the cortex and lower 
levels seen in the cerebellum and caudate and putamen. 

More regio-specific analysis of HIP1 expression in the brain revealed no 
differential expression pattern in affected individuals when compared to normal controls, 
5 with highest levels of expression seen in both controls and HD patients in the cortical 
regions. 

EXAMPLE 8 

CO-IMMUNOPRECIPITAT10N OF HIP1 WITH HUNTINGTIN 

10 Confirmation of the HD-HJP1 interaction was performed using coimmuno- 

prepiiation as follows. Control human brain (frontal cortex) lysate was prepared in the 
same manner as for subcellular localization study. Prior to immunoprecipitation, tissue 
lysate was centrifuged at 5000 rpm for 2 minutes at 4 C, then the supernatant was 
pre-cleared by the incubated with excess amount of Protein A-Sepharose for 30 minutes at 

15 4 C T and centrifuged at the same condition. Fifty microlitres of supernatant (500 mg 
protein) was incubated with or without antibodies (10 ug of anti-huntingtin GHM1 
(Kalchman, et al. 1996) or anti-synaptobrevin antibody) in the total 500 ul of incubation 
buffer (20mM Tris-Cl (pH7.5), 40mM NaCl, ImM MgCI 2 ) for 1 hour at 4 C. Twenty 
microlitres of Protein A-Sepharose (1:1 suspension, for GHM1 and no antibody control) or 

20 Protein G-Sepharose (for anti-synaptobrevin antibody; Pharmacia) was added and 

incubated for 1 hour at 4 C. The beads were washed with washing buffer (incubation 
buffer containing 0.5 % Triton X-100) three times. The samples on the beads were 
separated using SDS-PAGE (7.5% acrylamide) and transferred to PVDF membrane 
(Immobilon-P, Millipore). The membrane was cut at about 150 kDa after transfer for 

25 Western blotting (as described above). The upper piece was probed with anti-huntingtin 
BKP1 (1/1000) and lower piece with anti-HIPl antibody (1/300). 

The results showed that when an anti-HIPl polyclonal antibody was 
immunoreacted against a blot containing the GHM1 immunoprecipitates from the brain 
lysate a doublet was observed at approximately 100 kDa was. When GHM1 was immuno- 

30 reacted against the same immunoprecipitate the 350 kDa HD protein was also seen The 
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specificity of the HD-HIP1 interaction is seen as no immunoreactive bands seen are as a 
result of the proteins adsorbing to the Protein-A-Sepharose (Lysate + No Antibody) or 
when a random, non related antibody (Lysate + anti-Synaptobrevin) is used as the 
immunoprecipitating antibody. 

EXAMPLE 9 

Subcellular fractionation of brain tissue 

Cortical tissue (20-100 mg/ml) was homogenized, on ice, in a 2 ml 
pyrex-teflon IKA-RW15 homogenizer (Tekmar Company) in a buffer containing 0.303M 
sucrose, 20mM Tris-HCI pH 6.9, ImM MgCW, 0.5mM EDTA, ImM PMSF, ImM 
leupeptin, soybean trypsin inhibitor and ImM benzamidine (Wood et al., Human Moke. 
Genet. 5:481-487 (1996)). 

Crude membrane vesicles were isolated by two cycles of a three-step differ- 
ential centrifugation protocol in a Beckman TLA 120.2 rotor at 4 C based on the methods 
of Wood et al (1996). The first step precipitated cellular debris and nuclei from tissue 
homogenates for 5 minutes at 1300 x g (PI). The 1300 x g supernatant was subsequently 
centrifuged for 20 minutes at 14 000 x g to isolate synaptosomes and mitochondria (P2). 
Finally, microsomal and plasma membrane vesicles were collected by a 35 minute 
centrifugation at 142 000 x g (P3). The remaining supernatant was defined as the cytosolic 
fraction. 

High salt extraction of membranes 

Aliquots of P3 membranes were twice suspended at 2mg/ ml in 0.5M NaCl, 
lOmM Tris-HCI, 2mM MgCl 2 , pH7.2, containing protease inhibitors (see above). The 
same buffer without NaCl was used as a control. The membrane suspensions were 
incubated on ice for 30 minutes and then centrifuged at 142 000 x g for 30 minutes. 

Extraction of cytoskeletal and cytoskg letal-associated proteins. 

To extract cytoskeletal proteins, crude membrane vesicles from the P3 
fraction membrane were suspended in a volume of Triton X-100 extraction buffer to give a 
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protein: detergent ratio of 5: 1 . The composition of the Triton X-100 extraction buffer was 
based on the methods of Arai et aL, J. Neuroscience 38: 348-357 (1994) and contained 
2% Triton X-100, lOrnM Tris-HCI, 2mM MgCl,, ImM leupeptin, soybean trypsin 
inhibitor, PMSF and benzamidine. Membrane pellets were suspended by hand with a 
5 round-bottom teflon pestle, and placed on ice for 40 minutes. Insoluble cytoskeletal 

matrices were precipitated for 35 minutes at 142 000 x g in a Beckman TLA 120.2 rotor. 
The supernatant was defined as non-cytoskeletal-associated membrane or membrane- 
-associated protein and was removed. The remaining pellet was extracted with Triton 
X-100 a second time using the same conditions. We defined the final pellet as cytoskeletal 
10 and cytoskeletaNassociated protein. 

Solubilization of protein and analysis hy SDS-PAGE an d Western Blpping 

Membrane and cytoskeletal protein was solubilized in a minimum volume of 
1 % SDS, 3M urea, 0. ImM dithiothreitol in TBS buffer and sonicated. Protein concen- 
1 5 tration was determined using the BioRad DC Protein assay and samples were diluted at 
least 1 X with 5 X sample buffer (250mM Tris-HCl pH 6.8, 10% SDS, 25% glycerol, 
0.02% bromophenol blue and 7% 2-mercaptoethanol) and were loaded on 7.5% 
SDS-PAGE gels (Bio-Rad Mini-PROTEIN II Cell system) without boiling. Western 
blotting was performed as described above. 

20 

Immunohistochemistry 

Brain tissue was obtained from a normal C57BL/6 adult (6 months old) male 
mouse sacrificed with chloroform then perfusion-fixed with 4% v/v paraformaldehyde/0.0 1 
M phosphate buffer (4% PFA). The brain tissues were removed, immersion fixed in 4% 

25 PFA for 1 day, washed in 0.01 M phosphate buffered saline, pH 7.2 (PBS) for 2 days, and 
then equilibrated in 25% w/v sucrose PBS for 1 week. The samples were then snap-frozen 
in Tissue Tek molds by isopentane cooled in liquid nitrogen. After warming to -20 C, 
frozen blocks derived from frontal cortex, caudate/putamen, cerebellum and brainstem 
were cut into 14 mm sections for immunohistochemistry. Following washing in PBS, the 

30 tissue sections were blocked using 2.5% v/v normal goat serum for 1 hour at room 
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temperature. Primary antibodies diluted with PBS were applied to sections overnight at 4 
C. Optimal dilutions for the polyclonal antibodies BKP1 and HIP1 were 1:50. Using 
washes of 3 x 5 minutes in PBS at room temperature, sections were sequentially incubated 
with biotinylated secondary antibody and then an avidin-biotin complex reagent (Vecta 
5 Stain ABC Kit, Vector) for 60 minutes each at room temperature. Color was developed 
using 3-3'-diaminobenzidine tetrahydrocholoride and ammonium nickel sulfate. 

For controls, sections were treated as described above except that HIP1 
antibody aliquots were preabsorbed with an excess of HIP1 peptide as well as a peptide 
unrelated to HIP1 prior to incubation with the tissue sections. 

10 

In situ hybridization 

In situ hybridization was performed as previously described with some 
modification (Suzuki et al, BBRC219: 708-713 (1996)). The RNA probes were prepared 
using the plasmid gtl49 (Lin, B., et ah, Human Moke. Genet, 2: 1541-1545 (1994)) or a 

15 558 subclone of HIP1. The anti-sense and sense single-stranded RNA probes were 
synthesized using T3 and T7 RNA polymerases and the In Vitro Transcription Kit 
(Clontech) with the addition of |ct 35 S]-CTP (Amersham) to the reaction mixture. Sense 
RNA probes were used as negative controls. For HIP1 studies normal C57BL/6 mice were 
used. Huntingtin probes were tested on two different transgenic mouse strains expressing 

20 full-length huntingtin, cDNA HDI0366(44CAG) C57BL/6 mice and YAC 

HD10366(18CAG) FVB/N mice. Frozen brain sections (lOum thick) were placed onto 
silane-coated slides under RNase-free conditions. The hybridization solution contained 
40% w/v formamide, 0.02M Tris-HCI (pH 8.0), 0.005M EDTA, 0.3 M NaCI, 0.01M 
sodium phosphate (pH 7.0), lx Denhardt's solution, 10% w/v dextran sulfate (pH 7.0), 

25 0.2% w/v sarcosyl, yeast tRNA (500mg/ml) and salmon sperm DNA (200mg/ml). The 
radiolabeled RNA probe was added to the hybridization solution to give 1 x 106 cpm/200 
ul/ section. Sections were covered with hybridization solution and incubated on 
formamide paper at 65 C for 18 hours. After hybridization, the slides were washed for 30 
minutes sequentially with 2x SSC, lx SSC and high stringency wash solution (50% 

30 formamide, 2x SSC and 0.1M dithiothreitol) at 65 C, followed by treatment with Rnase A 
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(lmg/ml) at 37 C for 30 minutes, then washed again and air-dried. The si ides were first 
exposed on autoradiographic film (b-max, Amersham, UK) for 48 hours and developed for 
4 minutes in Kodak D-19 followed by a 5 minute fixation in Fuji-fix. For longer 
exposures, the slides were dipped in autoradiographic emulsion (50% w/v in distilled 
5 water, NR-2, Konica, Japan), air-dried and exposed for 20 days at 4 C then developed as 
described. Sections were counterstained with methyl green or Giemsa solutions. 
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SF.OirF.NCF. LISTING 

(1) GENERAL INFORMATION. 

(1) APPLICANT: Kalchman, Michael 
Goldberg, Paul 

Hayden. Michael R. 

(ii) TITLE OF INVENTION: Protein Which Interacts with the Huntington's Disease Gene 
Product, cDNA Coding Therefor, and Antibodies Thereto 

(iii) NUMBER OF SEQUENCES: 8 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Oppedahl & Larson 

(B) STREET: 1992 Commerce Street Suite 309 

(C) CITY: Yorktown 

(D) STATE: NY 

(E) COUNTRY: USA 

(F) ZIP: 10598 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE. Diskette, 3.50 inch, 1.44 Kb storage 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: MS DOS 5.0 

(D) SOFTWARE: WordPerfect 

(vi) CURRENT APPLICATION DATA 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Larson, Marina T. 

(B) REGISTRATION NUMBER: 32038 

(C) REFERENCE/DOCKET NUMBER: UBC P-013 

(ix) TELECOMMUNICATION INFORMATION 

(A) TELEPHONE: (914) 245-3252 

(B) TELEFAX: (914) 962-4330 

(2) INFORMATION FOR SEQ ID NO 1 : 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 1164 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: no 

(iv) ANTI-SENSE: no 
(vi) ORIGINAL SOURCE. 
(A) ORGANISM human 

(ix) FEATURE: cDNA for Huntingtin-interacting protein 
(xi)SEQUENCE DESCRIPTION: SEQ ID NO: 1 
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ACAGCTGACA CCCTGCAAGG CCACCGGGAC CGCTTCATGG AGCAGTTTAC 50 
AAAGTTGAAA GATCTGTTCT ACCGCTCCAG CAACCTGCAG TACTTCAAGC 100 
GGGTCATTCA GATCCCCCAG CTGCCTGAGA ACCCACCCAA CTTCCTGCGA 150 
GCCTCAGCCC TGTCAGAACA TATCAGCCCT GTGGTGGTGA TCCCTGCAGA 2 00 
GGCCTCATCC CCCGACAGCG AGCCAGTCCT AGAGAAGGAT GACCTCATGG 250 
ACATGGATGC CTCTCAGCAG AATTTATTTG ACAACAAGTT TGATGACNTC 300 
TTTGGCAGTT CATCCAGCAG TGATCCCTTC AATTTCAACA GTCAAAATGG 350 
TGTGAACAAG GATGAGAAGG ACCACTTAAT TGAGCGACTA TACAGAGAGA 400 
TCAGTGGATT GAAGGCACAG CTAGAAAACA TGAAGACTGA GAGCCAGCGG 450 
GTTGTGCTGC AGCTGAAGGG CCACGTCAGC GAGCTGGAAG CAGATCTGGC 500 
CGAGCAGCAG CACCTGCGGC AGCAGGCGGC CGACGACTGT GAATTCCTGC 550 
GGG CAGAACT GGACGAGCTC AGGNGGCAGC GGGAGGACAC CGAGAAGGCT 600 
CAGCGGAGCC TGTCTGAGAT AGAAAGGAAA GCTCAAGCCA ATGAACAGCG 65 0 
ATATAGCAAG CTAAAGGAGA AGTACAGCGA GCTGGTTCAG AACCACGCTG 700 
ACCTGCTGCG GAAGAATGCA GAGGTGACCA AACAGGTGTC CATGG CCAGA 750 
CAAGCCCAGG TAGATTTGGA ACGAGAGAAA AAAGAGCTGG AGGATT CGTT 800 
GGAGCGCATC AGTGACCAGG GCCAGCGGAA GACTCAAGAA GAGCTGGAAG 85 0 
TTCTAGAGAG CTTGAAGCAG GAACTTGGCA CAAGCCAACG GGAGCTTCAG 900 
GTTCTGCAAG GCAGCCTGGA AACTTCTGCC CAGTCAGAAG CAAACTGGGC 950 
AGCCGAGTTC GCCGAGCTAG AGAAGGAGCG GGACAGCCTG GTGAGTGGCG 1000 
CAGCTCATAG GGAGGAGGAA TTATCTGCTC TTCGGAAAGA ACTGCAGGAC 1050 
ACTCAGCTCA AACTGGCCAG CACAGAGGAA TCTATGTGCC AGCTTGCCAA 1100 
AGACCAACGA AAAATGCTTC TGGTGGGGTC CAGGAAGGCT GCGGAGCAGG 1150 
TGATACAAGA CGCG 1164 



(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 386 

(B) TYPE protein 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: no 
(vi) ORIGINAL SOURCE: 
(A) ORGANISM: human 

(ix) FEATURE: Huntingtin-interacting protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0 2 

Thr Ala Asp Thr Leu Gin Gly His Arg Asp Arg Phe Met Glu Gin 
15 10 15 

Phe Thr Lys Leu Lys Asp Leu Phe Tyr Arg Ser Ser Asn Leu Gin 

20 25 30 

Tyr Phe Lys Arg Val lie Gin lie Pro Gin Leu Pro Glu Asn Pro 

35 40 45 

Pro Asn Phe Leu Arg Ala Ser Ala Leu Ser Glu His lie Ser Pro 

50 55 60 

Val Val Val He Pro Ala Glu Ala Ser Ser Pro Asp Ser Glu Pro 

65 70 75 



04/06/2004, EAST Version: 1.4.1 



WO 97/18825 



PCT/US96/18370 



- 23 - 

Val Leu Glu Lys Asp Asp Leu Met Asp Met Asp Ala Ser Gin Gin 

80 85 90 

Asn Leu Phe Asp Asn Lys Phe Asp Asp Phe Gly Ser Ser Ser Ser 

95 100 105 

Ser Asp Pro Phe Asn Phe Asn Ser Gin Asn Gly Val Asn Lys Asp 
110 115 120 

Glu Lys Asp His Leu lie Glu Arg Leu Tyr Arg Glu lie Ser Gly 
125 130 135 

Leu Lys Ala Gin Leu Glu Asn Met Lys Thr Glu Ser Gin Arg Val 
140 145 150 

Val Leu Gin Leu Lys Gly His Val Ser Glu Leu Glu Ala Asp Leu 
155 160 165 

Ala Glu Gin Gin His Leu Arg Gin Gin Ala Ala Asp Asp Cys Glu 
170 175 180 

Phe Leu Arg Ala Glu Leu Asp Glu Leu Arg Gin Arg Glu Asp Thr 
185 190 195 

Glu Lys Ala Gin Arg Ser Leu Ser Glu lie Glu Arg Lys Ala Gin 
200 205 210 

Ala Asn Glu Gin Arg Tyr Ser Lys Leu Lys Glu Lys Tyr Ser Glu 
215 220 225 

Leu Val Gin Asn His Ala Asp Leu Leu Arg Lys Asn Ala Glu Val 
230 235 240 

Thr Lys Gin Val Ser Met Ala Arg Gin Ala Gin Val Asp Leu Glu 
245 250 255 

Arg Glu Lys Lys Glu Leu Glu Asp Ser Leu Glu Arg lie Ser Asp 
260 265 270 

Gin Gly Gin Arg Lys Thr Gin Glu Gin Leu Glu Val Leu Glu Ser 
275 280 285 

Leu Lys Gin Glu Leu Gly Thr Ser Gin Arg Glu Leu Gin Val Leu 
290 295 300 

Gin Gly Ser Leu Glu Thr Ser Ala Gin Ser Glu Ala Asn Trp Ala 
305 310 315 

Ala Glu Phe Ala Glu Leu Glu Lys Glu Arg Asp Ser Leu Val Ser 
320 325 330 

Gly Ala Ala His Arg Glu Glu Glu Leu Ser Ala Leu Arg Lys Glu 
335 340 345 
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Leu Gin Asp Thr Gin Leu Lys Leu Ala Ser Thr Glu Glu Ser Met 

350 355 360 

Cys Gin Leu Ala Lys Asp Gin Arg Lys Met Leu Leu Val Gly Ser 
365 370 375 

Arg Lys Ala Ala Glu Gin Val He Gin Asp Ala 
380 385 386 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: no 

(iv) ANTJ-SENSE: no 
(vi) ORIGINAL SOURCE: 
(A) ORGANISM: human 

(ix) FEATURE: cDNA for Huntingtin-interacting protein 
(xi)SEQUENCE DESCRIPTION: SEQ ID NO:3: 

ACCGATACCG AAGCGGGCTG TGTGCCCCTT CTCCACCCAG AGGAAATCAA 50 
ACCCCAAAGC CATTATAACC ATGGATATGG TGAACCTCTT GGACGGAAAA 100 
CTCATATTGA TGATTA CAGC ACATGGGACA TAGTCAAGGC TACACAATAT 150 
GGAATATATG AACGCTGTCG AGAATTGGTG GAAGCAGGTT ATGATGTACG 200 
GCAACCGGAC AAAGAAAATG TTACCCTCCT CCATTGGGCT GCCATCAATA 250 
ACAGAATAGA TTTAGTCAAA TACTATATTT CGAAAGGTGC TATTGTGGAT 300 
CAACTTGGAG GGGACCTGAA TTCAACTCCA TTGCACTGGG ACACAAGACA 350 
AGGCCATCTA TCCATGGTTG TGCAACTAAT GAAATATGGT GCAGATCCTT 400 
CATTAATTGA TGGAGAAGGA TGTAGCTGTA TTCATCTGGC TGCTCAGTTC 450 
GGACATACCT CAATTGTTGC TTATCTCATA GCAAAAGGAC AGGATGTG 498 



(2) INFORMATION FOR SEQ ID N0 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 154 

(B) TYPE, protein 

(D) TOPOLOGY linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: no 
(vi) ORIGINAL SOURCE. 
(A) ORGANISM: human 

(ix) FEATURE: Huntingtin-interacting protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID N04: 
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Thr Asp Thr Glu Ala Gly Cys Val Pro Leu Leu His Pro Glu Glu 
15 10 15 

He Lys Pro Gin Ser His Tyr Asn His Gly Tyr Gly Glu Pro Leu 
20 25 30 

Gly Arg Lys Thr His He Asp Asp Tyr Ser Thr Trp Asp He Val 

35 40 45 

Lys Ala Thr Gin Tyr Gly He Tyr Glu Arg Cys Arg Glu Leu Val 

50 55 60 

Glu Ala Gly Tyr Asp Val Arg Gin Pro Asp Lys Glu Asn Val Thr 

65 70 75 

Leu Leu His Trp Ala Ala He Asn Asn Arg He Asp Leu Val Lys 

80 85 90 

Tvr Tvr He Ser Lys Gly Ala He Val Asp Gin Leu Gly Gly Asp 

95 100 105 

Leu Asn Ser Thr Pro Leu His Trp Asp Thr Arg Gin Gly His Leu 
no H5 120 

Ser Met Val Val Gin Leu Met Lys Tyr Gly Ala Asp Pro Ser Leu 
125 130 135 

He Asp Gly Glu Gly Cys Ser Cys He His Leu Ala Ala Gin Phe 
140 145 150 



Gly His Thr Ser 
154 



(2) INFORMATION FOR SEQ ID NO:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4846 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: no 

(iv) ANTI-SENSE: no 
(vi) ORIGINAL SOURCE: 
(A) ORGANISM: human 

(ix) FEATURE: cDNA for Huntingtin-interacting protein 
(xi)SEQUENCE DESCRIPTION: SEQ ID N0 5: 

CAGTGTACGG TTGATCATAT AACGCCGCGG GCGGGGATTG GTTTATATAT 50 
CGCAAATTGA TNTAGGGGGG GGGGGATGGN CAGAGATTTC GCTTCATTAG 100 
GCCATTATAA GCAGGAAGGG TTTCAAGGAA AAAAACCCAG AAAGTGCATA 150 
TTGCACCCAC CATGAGAAAG GGGCAACAGA CCTTNTGTTN TGTTNTCAAC 200 
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CGCCTGCTTC TGTTTTAGCA ACGCAGTGTT TTGGTGGAAG TTGTGCCATG 250 

TGTTCCACAA ANTCTTCCGA GATGGACACC CGAACGTCCT GAAGGACTTT 3 00 

GTGAGATACA GAAATGAATT GAGTGACATG AGCAGGATGT GGGGCCACCT 3 50 

GAGCGAGGGG TATGGCCAGC TGTGCAGCAT CTACCTGAAA CTGCTAAGAA 4 00 

CCAAGATGGA GTACCACACC AAAAATCCCA GGTTCCCAGG CAACCTGCAG 450 

ATGAGTGACC GCCAGCTGGA CGAGGCTGGA GAAAGTGACG TGAACAACTT 500 

TTT CCAGTTA ACAGTGGAGA TGTTTGACTA CCTGGAGTGT GAACTCAACC 550 

TCTTCCAAAC AGTATTCAAC TCCCTGGACA TGTCCCGCTC TGTGTCCGTG 6 00 

ACGGCAGCAG GGCAGTGCCG CCTCGCCCCG CTGATCCAGG TCATCTTGGA 650 

CTGCAGCCAC CTTTATGACT ACACTGTCAA GCTTCTCTTC AAACTCCACT 700 

CCTGCCTCCC AGCTGACACC CTGCAAGGCC ACCGGGACCG CTTCATGGAG 750 

CAGTTTACAA AGTTGAAAGA TCTGTTCTAC CGCTCCAGCA ACCTGCAGTA 800 

CTTCAAGCGG CTCATTCAGA TCCCCCAGCT GCCTGAGAAC CCACCCAACT 850 

TCCTGCGAGC CTCAGCCCTG TCAGAACATA TCAGCCCTGT GGTGGTGATC 900 

CCTGCAGAGG CCTCATCCCC CGACAGCGAG CCAGTCCTAG AGAAGGATGA 950 

CCTCATGGAC ATGGATGCCT CTCAGCAGAA TTT ATTTGA C AACAAGTTTG 1000 

ATGACATCTT TGGCAGTTCA TTCAGCAGTG ATCCCTTCAA TTTCAACAGT 1050 

CAAAATGGTG TGAACAAGGA TGAGAAGGAC CACTTAATTG AGCGACTATA 1100 

CAGAGAGATC AGTGGATTGA AGGCACAGCT AGAAAACATG AAGACTGAGA 1150 

GCCAGCGGGT TGTGCTGCAG CTGAAGGGCC ACGTCAGCGA GCTGGAAGCA 1200 

GATCTGGCCG AGCAGCAGCA CCTGCGGCAG CAGGCGGCCG ACGACTGTGA 1250 

ATTCCTGCGG GCAGAACTGG ACGAGCTCAG GAGGCAGCGG GAGGACACCG 1300 

AGAAGGCTCA GCGGAGCCTG TCTGAGATAG AAAGGAAAGC TCAAGCCAAT 1350 

GAACAGCGAT ATAGCAAGCT AAAGGAGAAG TACAGCGAGC TGGTTCAGAA 14 00 

CCACGCTGAC CTGCTGCGGA AGAATGCAGA GGTGACCAAA CAGGTGTCCA 145 0 

TGGCCAGACA AGCCCAGGTA GATTTGGAAC GAGAGAAAAA AGAGCTGGAG 1500 

GATTCGTTGG AGCGCATCAG TGACCAGGGC CAGCGGAAGA CTCAAGAACA 1550 

GCTGGAAGTT CTAGAGAGCT TGAAGCAGGA ACTTGGCACA AGCCAACGGG 1600 

AGCTTCAGGT TCTGCAAGGC AGCCTGGAAA CTTCTGCCCA GTCAGAAGCA 165 0 

AACTGGGCAG CCGAGTTCGC CGAGCTAGAG AAGGAGCGGG ACAGCCTGGT 1700 

GAGTGGCGCA GCT CATAGGG AGGAGGAATT ATCTGCTCTT CGGAAAGAAC 1750 

TGCAGGACAC TCAGCTCAAA CTGGCCAGCA CAGAGGAATC TATGTGCCAG 180 0 

CTTGCCAAAG ACCAACGAAA AATGCTTCTG GTGGGGTCCA GGAAGGCTGC 1850 

GGAGCAGGTG ATACAAGACG CCCTGAACCA GCTTGAAGAA CCTCCTCTCA 1900 

TCAGCTGCGC TGGGTCTGCA GATCACCTCC TCTCCACGGT CACATCCATT 1950 

TCCAGCTGCA TCGAGCAACT GGAGAAAAGC TGGAGCCAGT ATCTGGCCTG 2 000 

CCCAGAAGAC ATCAGTGGAC TTCTCCATTC CATAACCCTG CTGGCCCACT 2 05 0 

TGACCAGCGA CGCCATTGCT CATGGTGCCA CCACCTGCCT CAGAGCCCCA 2100 

CCTGAGCCTG CCGACTCACT GACCGAGGCC TGTAAG CAGT ATGGCAGGGA 215 0 

AACCCTCGCC TACCTGGCCT CCCTGGAGGA AGAGGGAAGC CTTGAGAATG 2200 

CCGACAGCAC AGCCATGAGG AACTGCCTGA GCAAGATCAA GGCCATCGGC 2250 

GAGGAGCTCC TGCCCAGGGG ACTGGACATC AAGCAGGAGG AGCTGGGGGA 2300 

CCTGGTGGAC AAGGAGATGG CGGCCACTTC AGCTGCTATT GAAACTTGCA 2 350 

CGGCCAGAAT AGAGGAGATG CTCAGCAAAT CCCGAGCAQG AGACACAGGA 2400 

GTCAAATTGG AGGTGAATGA AAGGATCCTT CGTTGCTGTA CCAGCCTCAT 2450 

GCAAGCTATT CAGGTGCTCA TCGTGGCCTC TAAGGACCTC CAGAGAGAGA 2 500 

TTGTGGAGAG CGGCAGGGGT ACAGCATCCC CTAAAGAGTT TTATGCCAAG 2 550 

AACTCTCGAT GGACAGAAGG ACTTATCTCA GCCTCCAAGG CTGTGGGCTG 260 0 

GGGAGCCACT GTCATGGTGG ATGCAGCTGA TCTGGTGGTA CAAGGCAGAG 265 0 

GGAAATTTGA GGAGCTAATG GTGTGTTCTC ATGAAATTGC TGCTAGCACA 2 700 

GCCCAGCTTG TGGCTGCATC CAAGGTGAAA GCTGATAAGG ACAGCCCCAA 2 75 0 

CCTAGCCCAG CTGCAGCAGG CCTCTCGGGG AGTGAAC CAG GCCACTGCCG 2800 

GCGTTGTGGC CTCAACCATT TCCGGCAAAT CACAGATCGA AGAGACAGAC 2 850 

AACATGGACT TCTCAAGCAT GACGCTGACA CAGATCAAAC GCCAAGAGAT 290 0 



04/06/2004, EAST Version: 1.4.1 



WO 97/18825 



PCT/US96/18370 



- 27 - 



GGATTCTCAG 

GTCAAAAACT 

GCTGAGGGCT 

AGTGGTAACC 

TAAATCCTTG 

AATCCTTGGA 

AGGACATGCA 

GTTTGGACCC 

CCAGGCTGGC 

GTGGGGGGCA 

CTATTTGACC 

GTTTGTTGAC 

TTCTTCTCAG 

GAGGGCTGAT 

AGAAGGACGG 

CTCTTTGGGC 
GGTTTTTTGG 

CTCCCAAAGG 
GGTAGCTCCA 
CTCCACACTG 
CGGAGCTGCT 
CCTTTCCCTC 
CAAGGGAGAA 
TCCCGTGACG 
AGATCAACAA 
CTGTCAAGCG 
TCCCAACTCC 
GATCGTTCCA 
TGCTCCCATC 
ATTACTAACC 
AATCAGAACT 
ACATCAGCCT 
ACTTGGATCC 
ATGAGAAAGG 
AGGCTCTCGC 
CTTGCCAGGG 
CCAGCATTTA 
TATCTATAGC 



GTTAGGGTGC 
GGGAGAGCTT 
GGGAAGAAGG 
GAAAAAGAAT 
TTACCTATCT 
GTCCCAGGGG 
TGACACTTCC 
ATGGTCATCT 
CAGTGCTGCC 
GGGCCACTCA 
CCCACAACAA 
AGCTTGGAAA 
TCTTTTCAGT 
GGATTCCAAA 
CAGGAGTGTC 
AGTGCCATGG 
TTTTGTTTTT 
GCACACCCCT 
GCGATGGTGC 
ACCAAGTGCT 
GAGTGACAGC 
CTAAAGCTGA 
GACAACAGAA 
AGGCTCAAAA 
CACTACTTCC 
GGCCGTCTCC 
TTTCTGCAGA 
CTTTCTACGC 
AGGGAAGAAC 
TCCCTTAAGC 
TCAGGTGTGA 
TCAAGAATCA 
CAAAGCAAGG 
ACAGAGCCAG 
TGCCCTGTGG 
ATGGGCAGCC 
AGTGACCTTC 
AACTCATTGG 



TAGAGCTAGA 
CGGAAAAAGC 
AACAGAGGCA 
AGAGCCAAAC 
CGTGTGTGTT 
CAGCCACACC 
CAAAGATCCC 
CTGTTCTTTT 
CATGAGCAAG 
ACAGAGAGGA 
TGGGTATCCT 
GGGAAGATCT 
TTCATCATTT 
CCAGGACACT 
CTGGCTGTGA 
ATTTCCACTG 
TTTTTTTAAG 
GGGGCTGAGT 
TGCCCAGGCC 
GGCCCACCCA 
TTTCCTCAAA 
ATCCCGGCGG 
AGAGGGACAA 
ACTTGATCAC 
CTGCCGGAAT 
CTTGGCCCAG 
CGTCTGCCTT 
AATTGACAAA 
CCTATACTTG 
AGCAACAGCC 
CTCTAGCAAA 
GAAGAAAGCC 
AGATCATTTG 
CGGCTCCAAC 
ACAGGATGAG 
CAACAGCACT 
TGATCTTGGG 
TGGTAGCCAT 



AAATGAATTG 
ACTACGAGCT 
TCTCCACCTA 
CAACACCCCA 
ATTTCCCCAG 
ACTGCCATTA 
TCCATAGCGA 
CCCGCCTCCC 
CCTAGGTACG 
CCAACATCCA 
TAATAGAGGA 
TATGCCTTTT 
GCACAAACTT 
ACCCTGAGAT 
ATGCCAAAGC 
CTTCTTATGG 
TTTCACTCAC 
CTCCAGGGCC 
TCTCGGTGCT 
GTCCATGCTC 
AAGCAGAAGG 
AAAGCCTCTG 
GAGGGTTCAC 
ATGCTTGAAT 
GAACTGTCCG 
AGACGGAGTG 
GGCATCCTCT 
CCCGGAAGAT 
GTTTGCTACC 
TACAAAGAGA 
GCTCATCTTT 
AAGGTGCTGG 
GAGCTCTTGG 
TCCTTTCAGC 
GACAGAGGGC 
TTTCCTCTTC 
AAAACAGCGT 
CAAGCACTTC 



CAGAAGGAGC 
TGCTGGTGTT 
CACTGCAAGA 
TATGTCAGTG 
CCACAGGCCA 
CCCAGTGCCG 
CACCCTTTCT 
TAGTTAGCAT 
AAGAGGGGTG 
GTCCTGCTGA 
GCTGCTTGTT 
CTTTTCTGTT 
GTGAGCATCA 
CTGCACAGTC 
CATTCTCCCC 
TGGTTGGTTG 
ATAGCCAACT 
CCCCAACTGT 
CCATCTCCGC 
CA GGGTCAGG 
AGAGTGAGTG 
TCCGCCTTTA 
ACAGCCCAGT 
GGAGCTGGTG 
TGAATGGTCT 
TGGGAGTGAT 
TGAATAGGAA 
CAGATGCAAT 
CTTAGTATTT 
TGCTTGGAGC 
CTGCCCGGCT 
ACTGTTACTG 
GTCAGAGAAA 
CACATGCCCC 
ACATGAACAG 
TAGATGGACC 
CTTCCTTCTT 
GGAATT 



2950 
3000 
3050 
3100 
3150 
3200 
3250 
3300 
3350 
3400 
3450 
3500 
3550 
3600 
3650 
3700 
3750 
3800 
3850 
3900 
4W 

* 4200 ~ 

-4^tr 

,-445 0 " 

~*-&trrr 

-4*4-6- 



H-oro 

ffOO 

(tiro 

mi 
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(2) INFORMATION FOR SEQ ID NO:6 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 924 

(B) TYPE: protein 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: no 
(vi) ORIGINAL SOURCE: 
(A) ORGANISM human 

(ix) FEATURE: Huntingtin-interacting protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0.6 
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Met Ser Arg Met Trp Gly His Leu Ser Glu Gly Tyr Glv Gin Leu 
1 5 10 * 15 

Cys Ser He Tyr Leu Lys Leu Leu Arg Thr Lys Met Glu Tyr His 

20 25 30 

Thr Lys Asn Pro Arg Phe Pro Gly Asn Leu Gin Met Ser Asp Arg 

35 40 45 

Gin Leu Asp Glu Ala Gly Glu Ser Asp Val Asn Asn Phe Phe Gin 

50 55 60 

Leu Thr Val Glu Met Phe Asp Tyr Leu Glu Cys Glu Leu Asn Leu 

65 70 75 

Phe Gin Thr Val Phe Asn Ser Leu Asp Met Ser Arg Ser Val Ser 

80 85 90 

Val Thr Ala Ala Gly Gin Cys Arg Leu Ala Pro Leu He Gin Val 

95 100 105 

He Leu Asp Cys Ser His Leu Tyr Asp Tyr Thr Val Lys Leu Leu 
HO 115 120 

Phe Lys Leu His Ser Cys Leu Pro Ala Asp Thr Leu Gin Gly His 
125 130 135 

Arg Asp Arg Phe Met Glu Gin Phe Thr Lys Leu Lys Asp Leu Phe 
140 145 150 

Tyr Arg Ser Ser Asn Leu Gin Tyr Phe Lys Arg Leu lie Gin He 
155 160 165 

Pro Gin Leu Pro Glu Asn Pro Pro Asn Phe Leu Arg Ala Ser Ala 
170 175 180 

Leu Ser Glu His He Ser Pro Val Val Val He Pro Ala Glu Ala 
185 190 195 

Ser Ser Pro Asp Ser Glu Pro Val Leu Glu Lys Asp Asp Leu Met 
200 205 210 

Asp Met Asp Ala Ser Gin Gin Asn Leu Phe Asp Asn Lys Phe Asp 
215 220 225 

Asp He Phe Gly Ser Ser Phe Ser Ser Asp Pro Phe Asn Phe Asn 
230 235 240 

Ser Gin Asn Gly Val Asn Lys Asp Glu Lys Asp His Leu He Glu 
245 250 255 

Arg Leu Tyr Arg Glu He Ser Gly Leu Lys Ala Gin Leu Glu Asn 
260 265 270 
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Met Lys Thr Glu Ser Gin Arg Val Val Leu Gin Leu Lys Gly His 
275 280 285 

Val Ser Glu Leu Glu Ala Asp Leu Ala Glu Gin Gin His Leu Arg 
290 295 300 

Gin Gin Ala Ala Asp Asp Cys Glu Phe Leu Arg Ala Glu Leu Asp 
305 310 315 

Glu Leu Arg Arg Gin Arg Glu Asp Thr Glu Lys Ala Gin Arg Ser 
320 325 330 

Leu Ser Glu lie Glu Arg Lys Ala Gin Ala Asn Glu Gin Arg Tyr 
335 340 345 

Ser Lys Leu Lys Glu Lys Tyr Ser Glu Leu Val Gin Asn His Ala 
350 355 360 

Asp Leu Leu Arg Lys Asn Ala Glu Val Thr Lys Gin Val Ser Met 
365 370 375 

Ala Arg Gin Ala Gin Val Asp Leu Glu Arg Glu Lys Lys Glu Leu 
380 385 390 

Glu Asp Ser Leu Glu Arg lie Ser Asp Gin Gly Gin Arg Lys Thr 
395 400 405 

Gin Glu Gin Leu Glu Val Leu Glu Ser Leu Lys Gin Glu Leu Gly 
410 415 420 

Thr Ser Gin Arg Glu Leu Gin Val Leu Gin Gly Ser Leu Glu Thr 
425 430 435 

Ser Ala Gin Ser Glu Ala Asn Trp Ala Ala Glu Phe Ala Glu Leu 
440 445 450 

Glu Lys Glu Arg Asp Ser Leu Val Ser Gly Ala Ala His Arg Glu 
455 460 465 

Glu Glu Leu Ser Ala Leu Arg Lys Glu Leu Gin Asp Thr Gin Leu 
470 475 480 

Lys Leu Ala Ser Thr Glu Glu Ser Met Cys Gin Leu Ala Lys Asp 
485 490 495 

Gin Arg Lys Met Leu Leu Val Gly Ser Arg Lys Ala Ala Glu Gin 
500 505 510 

Val lie Gin Asp Ala Leu Asn Gin Leu Glu Glu Pro Pro Leu lie 
515 520 525 

Ser Cys Ala Gly Ser Ala Asp His Leu Leu Ser Thr Val Thr Ser 
530 535 540 
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He Ser Ser Cys He Glu Gin Leu Glu Lys Ser Trp Ser Gin Tyr 
545 550 555 

Leu Ala Cys Pro Glu Asp He Ser Gly Leu Leu His Ser He Thr 
560 565 570 

Leu Leu Ala His Leu Thr Ser Asp Ala He Ala His Gly Ala Thr 
575 580 585 

Thr Cys Leu Arg Ala Pro Pro Glu Pro Ala Asp Ser Leu Thr Glu 
590 595 600 

Ala Cys Lys Gin Tyr Gly Arg Glu Thr Leu Ala Tyr Leu Ala Ser 
605 610 615 

Leu Glu Glu Glu Gly Ser Leu Glu Asn Ala Asp Ser Thr Ala Met 
620 625 630 

Arg Asn Cys Leu Ser Lys He Lys Ala He Gly Glu Glu Leu Leu 
635 640 645 

Pro Arg Gly Leu Asp He Lys Gin Glu Glu Leu Gly Asp Leu Val 
650 655 660 

Asp Lys Glu Met Ala Ala Thr Ser Ala Ala He Glu Thr Cys Thr 
665 670 675 

Ala Arg lie Glu Glu Met Leu Ser Lys Ser Arg Ala Gly Asp Thr 
680 685 690 

Gly Val Lys Leu Glu Val Asn Glu Arg He Leu Arg Cys Cys Thr 
695 700 705 

Ser Leu Met Gin Ala He Gin Val Leu He Val Ala Ser Lys Asp 
710 715 720 

Leu Gin Arg Glu He Val Glu Ser Gly Arg Gly Thr Ala Ser Pro 
725 730 735 

Lys Glu Phe Tyr Ala Lys Asn Ser Arg Trp Thr Glu Gly Leu He 
740 745 750 

Ser Ala Ser Lys Ala Val Gly Trp Gly Ala Thr Val Met Val Asp 

^rr ^rf>o 

Ala Ala Asp Leu Val Val Gin Gly Arg Gly Lys Phe Glu Glu Leu 
J2^r -^e— 

^t) T?r 7 go 

Met Val Cys Ser His Glu He Ala Ala Ser Thr Ala Gin Leu Val 
-3J$*r 

Ala Ala Ser Lys Val Lys Ala Asp Lys Asp Ser Pro Asn Leu Ala 

for f f o 
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Gin Leu Gin Gin Ala Ser Arg Gly Val Asn Gin Ala Thr Ala Gly 
J&2*T 

?ir <pZO <f ^ 

Val Val Ala Ser Thr He Ser Gly Lys Ser Gin lie Glu Glu Thr 

Asp Asn Met Asp Phe Ser Ser Met Thr Leu Thr Gin He Lys Arg 

$4T ftf> ?rr 

Gin Glu Met Asp Ser Gin Val Arg Val Leu Glu Leu Glu Asn Glu 

-7Z ~°tTr 7£ 

Leu Gin Lys Glu Arg Gin Lys Leu Gly Glu Leu Arg Lys Lys His 

-Tr -7To -Iffr 

Tyr Glu Leu Ala Gly Val Ala Glu Gly Trp Glu Glu Gly Thr Glu 

.^S- -9ttr- 

890 F?r 9°° 

Ala Ser Pro Pro Thr Leu Gin Glu Val Val Thr Glu Lys Glu 

for 9/o 
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CLAIMS 

1 1 A cDN A molecule comprising the sequence given by Seq ID No. L 

1 2. A cDNA molecule comprising the sequence given by Seq. ID No. 5. 

1 3. A polypeptide comprising the sequence given by Seq. ID No. 2. 

1 4. A polypeptide comprising the sequence given by Seq. ID. No. 6. 

1 5. A chimeric gene or plasmid comprising at least nucleotides 3 14 to 1955 

2 of the Huntington's Disease gene and an activating or DNA binding domain suitable for use in 

3 a yeast multi-hybrid assay. 

1 6. The chimeric gene or plasmid according to claim 5, wherein the 

2 Huntington's Disease gene encodes a polyglutamine tract having a length of 35 or fewer 

3 residues. 

1 7. The chimeric gene or plasmid according to claim 5, wherein the 

2 Huntington's Disease gene encodes a polyglutamine tract having a length of 36 or more 

3 residues 

1 8. A method for ameliorating the effects of Huntington's disease in a 

2 patient expressing Huntingtin protein with an expanded CAG repeat region, comprising the 

3 step of increasing the amount of an expressed HD-interacting polypeptide in the brain of the 

4 patient, wherein the expressed HD-interacting polypeptide interacts less well with expanded 

5 Huntingtin than with Huntingtin having a CAG repeat region containing 1 5 to 35 repeats and 

6 facilitates the incorporation of Huntingtin into brain cell membranes 

1 9. The method according to claim 8, wherein the expressed HD- 

2 interacting polypeptide comprises the sequence given by Seq. ID No. 2. 
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1 1 0. An antibody which binds to a polypeptide having the sequence given by 

2 Seq. ID. No. 2. 

1 11. The antibody of claim 10, wherein the antibody binds to amino acids 

2 76-91 of the polypeptide having the sequence shown in Seq. ID No. 2. 

1 12. An expression vector for expression of a gene in a mammalian host 

2 comprising a region encoding an HD-interacting polypeptide, wherein the HD-interacting 

3 polypeptide interacts less well with expanded Huntingtin than with Huntingtin having a CAG 

4 repeat region containing 1 5 to 35 repeats and facilitates the incorporation of Huntingtin into 

5 brain cell membranes. 

1 13. An expression vector for expression of a gene in a mammalian host 

2 comprising a region that is the same as or complementary to Seq. ID NO. 1. 

1 14. An expression vector for expression of a gene in a mammalian host 

2 comprising a region that is the same as or complementary to Seq. ID NO. 5. 

1 15. The expression vector according to claims of claims 12-14, further 

2 comprising a region encoding Huntingtin having a polyglutamine tract of 35 or fewer, 

1 1 6. An oligonucleotide probe having a length of from 1 5-40 bases which 

2 specifically and selectively hybridizes with the cDNA given by Seq. ID No. 1 or a sequence 

3 complementary thereto. 
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